Picture for Li Su

Li Su

Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation

Add code
Mar 25, 2025
Viaarxiv icon

Computational Analysis of Yaredawi YeZema Silt in Ethiopian Orthodox Tewahedo Church Chants

Add code
Dec 25, 2024
Viaarxiv icon

Zema Dataset: A Comprehensive Study of Yaredawi Zema with a Focus on Horologium Chants

Add code
Dec 25, 2024
Figure 1 for Zema Dataset: A Comprehensive Study of Yaredawi Zema with a Focus on Horologium Chants
Figure 2 for Zema Dataset: A Comprehensive Study of Yaredawi Zema with a Focus on Horologium Chants
Figure 3 for Zema Dataset: A Comprehensive Study of Yaredawi Zema with a Focus on Horologium Chants
Figure 4 for Zema Dataset: A Comprehensive Study of Yaredawi Zema with a Focus on Horologium Chants
Viaarxiv icon

Query-centric Audio-Visual Cognition Network for Moment Retrieval, Segmentation and Step-Captioning

Add code
Dec 18, 2024
Viaarxiv icon

Distortion Recovery: A Two-Stage Method for Guitar Effect Removal

Add code
Jul 23, 2024
Figure 1 for Distortion Recovery: A Two-Stage Method for Guitar Effect Removal
Figure 2 for Distortion Recovery: A Two-Stage Method for Guitar Effect Removal
Figure 3 for Distortion Recovery: A Two-Stage Method for Guitar Effect Removal
Figure 4 for Distortion Recovery: A Two-Stage Method for Guitar Effect Removal
Viaarxiv icon

Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning

Add code
Jul 16, 2024
Figure 1 for Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning
Figure 2 for Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning
Figure 3 for Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning
Figure 4 for Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning
Viaarxiv icon

A Study on Synthesizing Expressive Violin Performances: Approaches and Comparisons

Add code
Jun 26, 2024
Figure 1 for A Study on Synthesizing Expressive Violin Performances: Approaches and Comparisons
Figure 2 for A Study on Synthesizing Expressive Violin Performances: Approaches and Comparisons
Figure 3 for A Study on Synthesizing Expressive Violin Performances: Approaches and Comparisons
Figure 4 for A Study on Synthesizing Expressive Violin Performances: Approaches and Comparisons
Viaarxiv icon

MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing

Add code
Jun 10, 2024
Figure 1 for MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing
Figure 2 for MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing
Figure 3 for MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing
Figure 4 for MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing
Viaarxiv icon

Context-aware Difference Distilling for Multi-change Captioning

Add code
May 31, 2024
Viaarxiv icon

BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer

Add code
Jan 05, 2024
Viaarxiv icon